Generalized Thompson sampling for sequential decision-making and causal inference
نویسندگان
چکیده
*Correspondence: [email protected]; [email protected] 1GRASP Laboratory, Electrical and Systems Engineering Department, University of Pennsylvania, Philadelphia, PA 19104, USA 2Max Planck Institute for Biological Cybernetics and Max Planck Institute for Intelligent Systems, Speemanstrasse 38, Tübingen 72076, Germany Abstract Purpose: Sampling an action according to the probability that the action is believed to be the optimal one is sometimes called Thompson sampling.
منابع مشابه
Erratum to: Generalized Thompson sampling for sequential decision-making and causal inference
*Correspondence: [email protected]; [email protected] 1GRASP Laboratory, Electrical and Systems Engineering Department, University of Pennsylvania, Philadelphia, PA 19104 USA 2Max Planck Institute for Biological Cybernetics and Max Planck Institute for Intelligent Systems, Speemanstrasse 38, Tübingen 72076 Germany Decisions in the presence of latent variables We correct errors in e...
متن کاملContext-dependent decision-making: a simple Bayesian model.
Many phenomena in animal learning can be explained by a context-learning process whereby an animal learns about different patterns of relationship between environmental variables. Differentiating between such environmental regimes or 'contexts' allows an animal to rapidly adapt its behaviour when context changes occur. The current work views animals as making sequential inferences about current...
متن کاملGeneralized interval-valued intuitionistic fuzzy Hamacher generalized Shapley Choquet integral operators for multicriteria decision making
The interval-valued intuitionistic fuzzy set (IVIFS) which is an extension of the Atanassov’s intuitionistic fuzzy set is a powerful tool for modeling real life decision making problems. In this paper, we propose the emph{generalized interval-valued intuitionistic fuzzy Hamacher generalized Shapley Choquet integral} (GIVIFHGSCI) and the emph{interval-valued intuitionistic fuzzy Hamacher general...
متن کاملCost Analysis of Acceptance Sampling Models Using Dynamic Programming and Bayesian Inference Considering Inspection Errors
Acceptance Sampling models have been widely applied in companies for the inspection and testing the raw material as well as the final products. A number of lots of the items are produced in a day in the industries so it may be impossible to inspect/test each item in a lot. The acceptance sampling models only provide the guarantee for the producer and consumer that the items in the lots are acco...
متن کاملLearning and Optimization for Sequential Decision Making 02 / 01 / 16 Lecture 4 : Thompson Sampling ( part 1 )
Consider the problem of learning a parametric distribution from observations. A frequentist approach to learning considers parameters to be fixed, and uses the data learn those parameters as accurately as possible. For example, consider the problem of learning Bernoulli distribution’s parameter ( a random variable is distributed as Bernoulli(μ) is 1 with probability μ and 0 with probability 1 −...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CASM
دوره 2 شماره
صفحات -
تاریخ انتشار 2014